Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
نویسندگان
چکیده
Opinion retrieval aims to tell if a document is positive, neutral or negative on a given topic. Opinion extraction further identifies the supportive and the non-supportive evidence of a document. To evaluate the performance of technologies for opinionated tasks, a suitable corpus is necessary. This paper defines the annotations for opinionated materials. Heterogeneous experimental materials are annotated, and the agreements among annotators are analyzed. How human can monitor opinions of the whole is also examined. The corpus can be employed to opinion extraction, opinion summarization, opinion tracking and opinionated question answering.
منابع مشابه
Sentence Level Subjectivity and Sentiment Analysis Experiments in NTCIR-7 MOAT Challenge
This paper describes our supervised approach to the opinionated and the polarity subtasks in the NTCIR-7 MOAT Challenge. We apply a sequential tagging approach at the token level and use the learned token labels in the sentence level classification tasks. In our formal run submissions, we utilized SVM in both tasks with syntactic and lexicon-based features. Additionally, we present our experime...
متن کاملJoint Chinese Word Segmentation and POS Tagging on Heterogeneous Annotated Corpora with Multiple Task Learning
Chinese word segmentation and part-ofspeech tagging (S&T) are fundamental steps for more advanced Chinese language processing tasks. Recently, it has attracted more and more research interests to exploit heterogeneous annotation corpora for Chinese S&T. In this paper, we propose a unified model for Chinese S&T with heterogeneous annotation corpora. We first automatically construct a loose and u...
متن کاملNEUOM: Identifying Opinionated Sentences in Chinese and English Text
NEUOM: Identifying Opinionated Sentences in Chinese and English Text Zhang, Ke Wang, Muhua Zhu, Tong Xiao, Jingbo Zhu Natural Language Processing Lab, Northeastern University {zhangcl, xiaotong, zhujingbo}@mail.neu.edu.cn [email protected], [email protected] Abstract This paper introduces our NEUOM system which participates in the opinionated sentence detection task, one of evaluation task...
متن کاملReducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations
We address the issue of consuming heterogeneous annotation data for Chinese word segmentation and part-of-speech tagging. We empirically analyze the diversity between two representative corpora, i.e. Penn Chinese Treebank (CTB) and PKU’s People’s Daily (PPD), on manually mapped data, and show that their linguistic annotations are systematically different and highly compatible. The analysis is f...
متن کاملHow to Evaluate Opinionated Keyphrase Extraction?
Evaluation often denotes a key issue in semanticsor subjectivity-related tasks. Here we discuss the difficulties of evaluating opinionated keyphrase extraction. We present our method to reduce the subjectivity of the task and to alleviate the evaluation process and we also compare the results of human and machine-based evaluation.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006